Dynamics of Boolean Networks with Scale-Free Topology 
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The dynamics of Boolean networks (the N-K model) with scale-free topology are studied here. 
The existence of a phase transition governed by the value of the scale-free exponent of the network 
is shown analytically by analyzing the overlap between two distinct trajectories. The phase diagram 
shows that the phase transition occurs for values of the scale-free exponent in the open interval 
(2,2.5). Since the Boolean networks under study are directed graphs, the scale-free topology of 
the input connections and that of the output connections are studied separately. Ultimately these 
two topologies are shown to be equivalent. An important result of this work is that the fine-tuning 
usually required to achieve stability in Boolean networks with a totally random topology is no longer 
necessary when the network topology is scale- free. 
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The study and characterization of the statistical prop- 
erties of complex networks has received renewed atten- 
tion in the last few years ^ In particular, it has 
been recently shown by A. L. Barabasi, M. Newman, 
R. Sole and many other authors that a great variety 
of real networks exhibit a scale-free topology, including 
the WWW and the Internet |Q 3], social networks pi] , 
metabolic and protein networks prf |l2| 1^1 , ecological 
networks [Q and genetic networks |£, |l7|, to men- 
tion just a few examples (for more references see |2| [|). 
A scale-free topology means that the probability P[k) 
that an arbitrary element of the network is connected 
to exactly k other elements has the form P(k) — Ck^ 1 ', 
where 7 is usually called the scale- free exponent. Scale- 
free networks have the key property that a small fraction 
of the elements are highly connected whereas the major- 
ity of the elements are poorly connected. The ubiquity 
of scale-free networks has led to a systematic study of 
the structural properties that characterize the wiring di- 
agram of the network. Nevertheless, the dynamics gener- 
ated by a scale-free network topology when the elements 
are provided with some kind of dynamic interaction rule, 
remain essentially unexplored. 

An interesting dynamical network in which the scale- 
free topology has important implications is the N-K 
model proposed by Stuart Kauffman in 1969 to describe 
generically the dynamics involved in the processes of gene 
regulation and cell differentiation |0. In this classic 
model, the genome of a given organism is represented by 
a set of N genes, each being a binary variable describ- 
ing the two possible states of gene-expression: either the 
gene is expressed (1) or it is not (0). Since the expression 
of a gene is controlled by the expression of some other 
genes, Kauffman assumed the genome as a directed net- 
work in which a link from a given gene A to another gene 
B means that A controls the expression of B. In view of 
the complexity exhibited by real genetic networks, Kauff- 
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man made three simplifying assumptions: (a) every gene 
is connected (is controlled) by exactly K other genes; (b) 
the K genes to which every gene is connected are chosen 
randomly with uniform probability from the entire sys- 
tem; (c) each gene is expressed with probability p and is 
not expressed with probability 1 — p, depending upon the 
configurations of its K controlling genes. 

Even with these simplifying assumptions, a very rich 
and unexpected behavior of the model was found (for 
references see ||). In particular, in 1986 Derrida and 
Pomeau showed analytically the existence of a dynami- 
cal phase transition controlled by the parameters K and 
p ||. For every value of p there exists a critical value 
of the connectivity, K c (p) = [2p(l — p)]^ 1 , such that if 
K < K c (p) all perturbations in an initial state of the 
system die out (ordered phase). For K > K c {p) a small 
perturbation in the initial state of the system propagates 
across the entire system over time (chaotic phase). Ac- 
cording to Kauffman, Stauffer, and other authors, only 
when K = K c (p) (the critical phase) does the N-K model 
have the required stability properties compatible with the 
order manifest in the genetic networks of living organisms 
JTi] , pp[ . This fact made Kauffman coin the term "life at 
the edge of chaos" . 

Although the N-K model qualitatively points in the 
right direction, it fails to account for a quantitative de- 
scription of what actually is observed in genetic networks. 
One of the main problems is that the critical connectiv- 
ity K c (p) is very small for most values of p (see Fig. [I]). 
In contrast, it is well known that real genetic networks 
exhibit a wide range of connectivities. For example, the 
expression of the human /3-globine gene (expressed in red 
blood cells) or the even-skipped gene in Drosophila (play- 
ing an important role in the development of the embryo) , 
are each controlled by more than 20 different regulatory 
proteins [Q . Analogously, both the the fibroblast growth 
factor (FGF) activation and the platelet-derived growth 
factor (PDGF) activation in mammalian cells, result in 
the cascade activation of over 60 other genes 0. On 
the other side of the spectrum is the lac operon in E. 
coli, which is regulated by only two proteins: the lac re- 
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FIG. 1: Phase diagram for the standard N-K model. The 
solid curve is the critical connectivity K c as a function of 
p. The shaded area represents the ordered phase (K < K c ) 
whereas the white area represents the chaotic phase (K > 
K B ). 

pressor protein and the catabolite activator protein [Q. 
These observations are contrary to the Kauffman model 
since high connectivities imply either chaotic behavior or 
almost constant boolean functions (p very close to or 1). 
To achieve stability with moderate high connectivities in 
the N-K model, it is necessary to fine-tune the value of 
p. For instance, in order to have ordered dynamics when 
the critical connectivity is K c = 20, the value of p should 
be in the interval < p < 0.026 (or 0.974 < p < 1). 
As far as we can tell, there is neither a theoretical nor 
an experimental reason justifying why the parameters K 
and p should "live" in the shaded area of Fig. |l|. 

The above suggests that the random network topology 
upon which the Kauffman model is based, is inadequate. 
In view of the ubiquity of scale-free networks and of the 
fact that some genes in real genetic networks are highly 
connected whereas others are not, it is reasonable to re- 
place assumptions (a) and (b) mentioned above by the 
assumption that the connectivity k of every gene in the 
network follows a scale-free distribution, P(k) ~ fc~ 7 . 
It was not until very recently that the dynamics of the 
N-K model with a scale-free topology were studied nu- 
merically || [l7|. Nevertheless, the values of 7 and p at 
which the phase transition occurs, if it does occur at all, 
were unknown in these previous works and the results 
may have to be reinterpreted. In this letter we show an- 
alytically that the N-K model with a scale-free topology 
undergoes a phase transition controlled by the scale-free 
exponent 7 and the parameter p, and provide the phase 
diagram that fully identifies the ordered, critical, and 
chaotic phases. 

The model which we will be working with is the follow- 
ing. The network is represented by a set of N boolean 
variables (or elements), {ci, 02, . . . , <Jn}- Each element 
Cj is controlled by ki other elements of the network, 
where ki is chosen randomly with probability Pj(ki). 
Let {(Ti 1 , . . . , <7i fe . } be the set of the controlling elements 



FIG. 2: Plot of the mapping F(x) as a function of x for 
a scale-free distribution Pz(k). The solid curve corresponds 
to 7 = 3, for which lim^!- dF(x)/dx < 1. The only fixed 
point in this case is x = 1. The dashed curve corresponds to 
7 = 1.5, for which \im x _ 1 - dF(x)/dx = 00. In this other case 
there is a fixed point x* 7^ 1. The dashed-dotted line is the 
identity. 

of Ci. We then assign to each cr* a boolean function 
/((Tiu . . . ,cri h .) such that for each configuration of the 
controlling elements, /, = 1 with probability p and 
fi = with probability 1 — p. Once the controlling 
elements and the boolean functions are assigned to ev- 
ery element in the network, the dynamics of the sys- 
tem is given by Oi{t + 1) = fi(a h (t), a ik . (t)). We 
denote as E t the state of the entire system at time t, 
S t = {ai(t),a 2 (t), . . . ,a N (t)}. 

To show the existence of the phase transition we con- 
sider the overlap x(t) between two distinct configurations 
E t and E t (the temporal evolution of these two configu- 
rations is governed by the same set of boolean functions). 
The overlap is defined as the fraction of elements in both 
configurations that have the same value, and is given by 

1 N 

x{t) = i--Y d Wi{t)-~<t)\- (i) 

i=l 

If Et and S t are totally independent, then x(t) rj 0.5 
whereas if they are almost equal x(t) ~ 1. In the limit 
N — > 00 the overlap becomes the probability for two 
arbitrary but corresponding elements, <Tj(t) G S t and 
di{t) S St, to be equal. The stationary value of the over- 
lap, defined as x = lim^oo x(t), can be considered as 
an order parameter of the system. If x = 1, the sys- 
tem is insensitive to initial perturbations (all differences 
between configurations die out over time). In this case 
the system presents an ordered behavior. On the other 
hand, if x ^ 1, the initial perturbations propagate across 
the entire system and do not disappear. In this case the 
system exhibits a chaotic behavior. 

By generalizing the annealed computation carried out 
by Derrida and Pomeau in Ref. |i| to the case in which 
each element receives k inputs with probability -Pj(fc), 
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one finds that the overlap obeys the dynamical equation 

x(t + 1) = F (x(t)) , (2a) 
where the mapping F(x) is given by 

oo 

F{x) = l-2p{\-p)[l-^ k Pi{k)}. (2b) 

k=l 

In the limit t — > oo, Eq. (||) becomes the fixed point 
equation x = F(x) for the stationary value of the over- 
lap. Note that x = 1 is always a fixed point of Eq. (||). 
Nonetheless, this solution may be stable or unstable de- 
pending on Px(k). Note also that F(x) is a monotonically 
increasing function of x with the property that F (0) = 
1 - 2p(l -p) and F(l) = 1. Therefore, Eq. (|) will have 
a stable fixed point x* ^ 1 only if linXj.^- dF(x)/dx > 1 
(chaotic phase). In contrast, if linx,,^- dF(x)/dx < 1 
the only fixed point is x = 1 (ordered phase). The 
situation is illustrated in Fig. ^|. The phase transition 
between the ordered and chaotic regimes occurs when 
lim^x- dF(x)/dx = 1. From Eq. (§t| it follows that 



lim 



dF{x) 
dx 



2p(l-p)J2kP I (k) 



k=l 



2p(l-p)(k) x , 



(3) 



where (k)j — Ylk=x kPx(k) is the first moment of Px(k). 
The phase transition is then determined by the condition 



2p(l-p)(k) x = l. 



(4) 



In the standard N-K model all the elements have the 
same connectivity, fcj = (k)x — K, and Eq. (jij) reduces 
to the result obtained by Derrida and Pomeau in Ref . M . 
(Luque and Sole derived Eq. (|J) in Ref. |l5) by consid- 
ering the set of relevant elements of the system — the el- 
ements that do not reach a constant value over time.) 

It is interesting to note that the phase transition is 
governed only by the first moment of Px(t). Neverthe- 
less, for the scale-free distribution Px(k) = Ck~ 7 , the 
first moment is not necessarily a meaningful parameter 
to characterize the network topology. For instance, if 
2 < 7 < 3, the second moment of the distribution is 
infinite even when the first moment has a finite value, 
which means that the fluctuations around the first mo- 
ment are much larger than the first moment itself. There- 
fore, rather than characterizing the phase transition us- 
ing the first moment of the distribution, we will do it 
by means of the scale-free exponent 7, which is the only 
natural parameter that determines the network topology. 

For the scale-free distribution to be normalizable, it 
is necessary to have 7 > 1. Under such conditions, the 
probability function Px{k) is given by 
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FIG. 3: Phase diagram for the N-K model with a scale- 
free network topology. The solid line is the value of y c as a 
function of p given by the transcendental equation (hj). The 
white and shaded areas represent the chaotic phase (7 < y c ) 
and the ordered phase (7 > 7 C ), respectively. 



where C(7) = J2T=i ^ 7 ^ s ^ ne Riemann Zeta function. 
The first moment of this distribution is then expressed 
as a function of 7 as, 



(kh 



oc 



C(7~l) 



if 1 < 7 < 2 
if 7 > 2 



(6) 



From Eqs. (||)-(||), it follows that the fixed point x — 1 
is unstable if 1 < 7 < 2. In this case Eq. (^) has a stable 
fixed point x* ^ 1, and the system is in the chaotic phase 
for any value of p in the open interval (0,1). The only 
way in which the overlap between two distinct configura- 
tions can converge to 1 is if all the boolean functions are 
constant, namely, if either p = or p = 1. On the other 
hand, when 7 > 2 the first moment of the distribution is 
finite. In this case, the value 7 C of the scaling-free expo- 
nent at which the phase transition occurs is determined 
by the transcendental equation, 



2P(1-P)%^ = 1. 

C(7c) 



(7) 



The values j c and p for which this equation is satisfied 
are plotted in Fig. ||. As can be seen, 7 C e [2, 2.5] for 
any value of p. Actually, j c reaches its maximum value 
^max w 2.47875 at p = 0.5. Above this maximum value 
the system is always in the ordered phase regardless of 
the value or p. It is interesting to note that 7 6 [2,3] 
for the majority of the real scale-free networks that have 
been analyzed Q. 

An important characteristic of the N-K model is that 
it is a directed graph: if Oi is an input of cij, the op- 
posite does not necessarily occur. Every element cj{ is 
regulated by ki elements. But Oi can in turn regulate 
the value of a number of other elements, say li. The 
distribution Pj(fc) of input connections is not necessarily 
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equal to the distribution P (l) of output connections. Up 
to now we have assumed that Px(k) is a scale- free dis- 
tribution. But as the example of the fibroblast growth 
factor (FGF) suggests, it is also possible for the network 
of output connections to present a scale-free topology. To 
apply our previous results to the case in which P (l) is 
known instead of Pi(k), we should find how these two 
distributions are related. 

Suppose that an arbitrary element <7j of the network 
has U outputs with probability P (h) — I, T 1 /C{l)- (We 
assume that 7 > 2 so that the first moment of the dis- 
tribution is well defined.) When the k outputs of every 
element are chosen randomly with uniform probability 
from the entire system, the input probability distribu- 
tion Pr(k) is given by 

™-G9(%)>%n. <*> 

where (l) a is the first moment of P (l). In the limit 
N — > 00, the above expression transforms into the Pois- 
son distribution: 

Px(fc) = e-^Ii%£. ( 9 ) 

This result shows that the fist moment of the input dis- 
tribution Px{k) is equal to the first moment of the output 
distribution P (k), {k) T = (l) . Therefore, Eq. (Q) and 
the phase diagram shown in Fig. are still valid if we 
substitute (l)x by (l) . 

A consequence of the scale-free network topology is 
that the constraint of having a very low connectivity for 



every element (or a high connectivity with almost con- 
stant functions) in order to be in the ordered phase, is no 
longer valid. The scale-free topology eliminates the ne- 
cessity of fine-tuning the parameters K and p to achieve 
stable dynamics. Furthermore, the fact that the phase 
transition occurs for values of 7 in the interval (2,2.5), 
allows the existence of elements with a wide range of 
connectivities in the critical and ordered phases, which is 
required to describe the observed behavior of real genetic 
networks. Now that the human genome, as well as the 
genome of some other organisms, have been thoroughly 
sequenced, it will be possible to determine experimen- 
tally if the topology of real genetic networks is scale-free 
or not p2| . More work is called for to fully characterize 
the statistical properties of Boolean networks with scale- 
free topology, such as the distribution of the number of 
different orbits, the distribution of the orbit lengths, and 
the stability of the dynamics in the three different phases 
of the N-K model. We believe that the phase diagram 
shown in Fig. p|will be a useful guide for further studies. 
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